hi this is debrupa nandi today is my second day with git

studies

observational - 1. collect data in a way that does not directly interfere with how the data arise(“observe”) 2. only establish an association 3. retrospective uses past data 4. prospective - data are collected throughout the study

experiment- randomly assign subjects to treatments

Confounding variables - extraneous variables that affect both the explanatory and the response variable and that make it seem like there is a relationship between them

Correlation does not imply causation Observational staements help make us make correlation statements Experiments help us infer causation

In statistics, a confounder is a variable that influences both the dependent variable and independent variable, causing a spurious association. Confounding is a causal concept, and as such, cannot be described in terms of correlations or associations.

library(dplyr)
library(ggplot2)
library(statsr)
data(arbuthnot)
View(arbuthnot)

library(dplyr)
library(ggplot2)
library(statsr)

data(present)
View(present)

present <- present %>%
  mutate(total = boys + girls)
prop_boys <- present$boys/present$total
present <- present %>%
  mutate(prop_boys)
ggplot(data = present, aes(x = year, y = prop_boys)) +
  geom_line() + geom_point() + geom_smooth()

present <- present %>%
  mutate(more_boys = present$boys > present$girls)
View(present)
present <- present %>%
  mutate(prop_boy_girl = boys / girls)
View(present)
ggplot(data = present, aes(x = year, y = prop_boy_girl)) +
  geom_line() + geom_point() + geom_smooth()

present <- present %>%
arrange(desc(total))

Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normal. Robust statistical methods have been developed for many common problems, such as estimating location, scale, and regression parameters.

The response variable is the focus of a question in a study or experiment. An explanatory variable is one that explains changes in that variable. It can be anything that might affect the response variable. Let’s say you’re trying to figure out if chemo or anti-estrogen treatment is better procedure for breast cancer patients. The question is: which procedure prolongs life more? And so survival time is the response variable. The type of therapy given is the explanatory variable; it may or may not affect the response variable. In this example, we have only one explanatory variable: type of treatment. In real life you would have several more explanatory variables, including: age, health, weight and other lifestyle factors.

LS0tDQp0aXRsZTogIkludHJvZHVjdGlvbiB0byBwcm9iYWJpbGl0eSBhbmQgZGF0YSINCmF1dGhvcjogIkRlYnJ1cGEgTmFuZGkiDQpvdXRwdXQ6IGh0bWxfbm90ZWJvb2sNCi0tLQ0KDQpoaSB0aGlzIGlzIGRlYnJ1cGEgbmFuZGkNCioqdG9kYXkgaXMgbXkgc2Vjb25kIGRheSB3aXRoIGdpdCoqDQoNCg0KDQojIHN0dWRpZXMNCioqb2JzZXJ2YXRpb25hbCAtICoqDQoxLiBjb2xsZWN0IGRhdGEgaW4gYSB3YXkgdGhhdCBkb2VzIG5vdCBkaXJlY3RseSBpbnRlcmZlcmUgd2l0aCBob3cgdGhlIGRhdGEgYXJpc2UoIm9ic2VydmUiKQ0KMi4gb25seSBlc3RhYmxpc2ggYW4gYXNzb2NpYXRpb24NCjMuIHJldHJvc3BlY3RpdmUgdXNlcyBwYXN0IGRhdGENCjQuIHByb3NwZWN0aXZlIC0gZGF0YSBhcmUgY29sbGVjdGVkIHRocm91Z2hvdXQgdGhlIHN0dWR5DQoNCioqZXhwZXJpbWVudCoqLSByYW5kb21seSBhc3NpZ24gc3ViamVjdHMgdG8gdHJlYXRtZW50cw0KDQpDb25mb3VuZGluZyB2YXJpYWJsZXMgLSBleHRyYW5lb3VzIHZhcmlhYmxlcyB0aGF0IGFmZmVjdCBib3RoIHRoZSBleHBsYW5hdG9yeSBhbmQgdGhlIHJlc3BvbnNlIHZhcmlhYmxlIGFuZCB0aGF0IG1ha2UgaXQgc2VlbSBsaWtlIHRoZXJlIGlzIGEgcmVsYXRpb25zaGlwIGJldHdlZW4gdGhlbQ0KDQoqKkNvcnJlbGF0aW9uIGRvZXMgbm90IGltcGx5IGNhdXNhdGlvbioqDQoqKk9ic2VydmF0aW9uYWwgc3RhZW1lbnRzIGhlbHAgbWFrZSB1cyBtYWtlIGNvcnJlbGF0aW9uIHN0YXRlbWVudHMqKg0KKipFeHBlcmltZW50cyBoZWxwIHVzIGluZmVyIGNhdXNhdGlvbioqDQoNCiFbXShpbWFnZXNfZm9yX3IvQW5ub3RhdGlvbiAyMDE5LTA4LTIxIDEyMzExMS5wbmcpDQoNCg0KDQohW10oaW1hZ2VzX2Zvcl9yL0Fubm90YXRpb24gMjAxOS0wOC0yMSAxMjM1NTYucG5nKQ0KDQoNCg0KIVtdKGltYWdlc19mb3Jfci9Bbm5vdGF0aW9uIDIwMTktMDgtMjEgMTIzODIwLnBuZykNCg0KIVtdKGltYWdlc19mb3Jfci9Bbm5vdGF0aW9uIDIwMTktMDgtMjEgMTI0MTA1LnBuZykNCg0KIVtdKGltYWdlc19mb3Jfci9Bbm5vdGF0aW9uIDIwMTktMDgtMjEgMTI0NDU2LnBuZykNCg0KDQoNCiFbXShpbWFnZXNfZm9yX3IvQW5ub3RhdGlvbiAyMDE5LTA4LTIxIDEyNTAzNC5wbmcpDQoNCg0KIVtdKGltYWdlc19mb3Jfci9Bbm5vdGF0aW9uIDIwMTktMDgtMjEgMTI1MjI3LnBuZykNCg0KDQoNCiFbXShpbWFnZXNfZm9yX3IvQW5ub3RhdGlvbiAyMDE5LTA4LTIxIDEzMDYxMS5wbmcpDQoNCiFbXShpbWFnZXNfZm9yX3IvQW5ub3RhdGlvbiAyMDE5LTA4LTIxIDEzMDkyMy5wbmcpDQoNCiMjIyMgSW4gc3RhdGlzdGljcywgYSBjb25mb3VuZGVyIGlzIGEgdmFyaWFibGUgdGhhdCBpbmZsdWVuY2VzIGJvdGggdGhlIGRlcGVuZGVudCB2YXJpYWJsZSBhbmQgaW5kZXBlbmRlbnQgdmFyaWFibGUsIGNhdXNpbmcgYSBzcHVyaW91cyBhc3NvY2lhdGlvbi4gQ29uZm91bmRpbmcgaXMgYSBjYXVzYWwgY29uY2VwdCwgYW5kIGFzIHN1Y2gsIGNhbm5vdCBiZSBkZXNjcmliZWQgaW4gdGVybXMgb2YgY29ycmVsYXRpb25zIG9yIGFzc29jaWF0aW9ucy4gIyMjIw0KDQohW10oaW1hZ2VzX2Zvcl9yL0Fubm90YXRpb24gMjAxOS0wOC0yMSAxMzE3MjMucG5nKQ0KDQohW10oaW1hZ2VzX2Zvcl9yL0Fubm90YXRpb24gMjAxOS0wOC0yMiAxMTAyNTkucG5nKQ0KDQohW10oaW1hZ2VzX2Zvcl9yL0Fubm90YXRpb24gMjAxOS0wOC0yMiAxMTE2NDUucG5nKQ0KDQohW10oaW1hZ2VzX2Zvcl9yL0Fubm90YXRpb24gMjAxOS0wOC0yMiAxMTIxNDMucG5nKQ0KYGBge3J9DQpsaWJyYXJ5KGRwbHlyKQ0KbGlicmFyeShnZ3Bsb3QyKQ0KbGlicmFyeShzdGF0c3IpDQpkYXRhKGFyYnV0aG5vdCkNClZpZXcoYXJidXRobm90KQ0KYGBgDQoNCg0KDQoNCiFbXShpbWFnZXNfZm9yX3IvQW5ub3RhdGlvbiAyMDE5LTA4LTIyIDExNDcxMS5wbmcpDQoNCmBgYHtyfQ0KbGlicmFyeShkcGx5cikNCmxpYnJhcnkoZ2dwbG90MikNCmxpYnJhcnkoc3RhdHNyKQ0KDQpkYXRhKHByZXNlbnQpDQpWaWV3KHByZXNlbnQpDQoNCnByZXNlbnQgPC0gcHJlc2VudCAlPiUNCiAgbXV0YXRlKHRvdGFsID0gYm95cyArIGdpcmxzKQ0KcHJvcF9ib3lzIDwtIHByZXNlbnQkYm95cy9wcmVzZW50JHRvdGFsDQpwcmVzZW50IDwtIHByZXNlbnQgJT4lDQogIG11dGF0ZShwcm9wX2JveXMpDQpnZ3Bsb3QoZGF0YSA9IHByZXNlbnQsIGFlcyh4ID0geWVhciwgeSA9IHByb3BfYm95cykpICsNCiAgZ2VvbV9saW5lKCkgKyBnZW9tX3BvaW50KCkgKyBnZW9tX3Ntb290aCgpDQpwcmVzZW50IDwtIHByZXNlbnQgJT4lDQogIG11dGF0ZShtb3JlX2JveXMgPSBwcmVzZW50JGJveXMgPiBwcmVzZW50JGdpcmxzKQ0KVmlldyhwcmVzZW50KQ0KcHJlc2VudCA8LSBwcmVzZW50ICU+JQ0KICBtdXRhdGUocHJvcF9ib3lfZ2lybCA9IGJveXMgLyBnaXJscykNClZpZXcocHJlc2VudCkNCmdncGxvdChkYXRhID0gcHJlc2VudCwgYWVzKHggPSB5ZWFyLCB5ID0gcHJvcF9ib3lfZ2lybCkpICsNCiAgZ2VvbV9saW5lKCkgKyBnZW9tX3BvaW50KCkgKyBnZW9tX3Ntb290aCgpDQpwcmVzZW50IDwtIHByZXNlbnQgJT4lDQphcnJhbmdlKGRlc2ModG90YWwpKQ0KDQoNCmBgYA0KIyMjIyBSb2J1c3Qgc3RhdGlzdGljcyBhcmUgc3RhdGlzdGljcyB3aXRoIGdvb2QgcGVyZm9ybWFuY2UgZm9yIGRhdGEgZHJhd24gZnJvbSBhIHdpZGUgcmFuZ2Ugb2YgcHJvYmFiaWxpdHkgZGlzdHJpYnV0aW9ucywgZXNwZWNpYWxseSBmb3IgZGlzdHJpYnV0aW9ucyB0aGF0IGFyZSBub3Qgbm9ybWFsLiBSb2J1c3Qgc3RhdGlzdGljYWwgbWV0aG9kcyBoYXZlIGJlZW4gZGV2ZWxvcGVkIGZvciBtYW55IGNvbW1vbiBwcm9ibGVtcywgc3VjaCBhcyBlc3RpbWF0aW5nIGxvY2F0aW9uLCBzY2FsZSwgYW5kIHJlZ3Jlc3Npb24gcGFyYW1ldGVycy4gIyMjIw0KDQojIyMjIFRoZSByZXNwb25zZSB2YXJpYWJsZSBpcyB0aGUgZm9jdXMgb2YgYSBxdWVzdGlvbiBpbiBhIHN0dWR5IG9yIGV4cGVyaW1lbnQuIEFuIGV4cGxhbmF0b3J5IHZhcmlhYmxlIGlzIG9uZSB0aGF0IGV4cGxhaW5zIGNoYW5nZXMgaW4gdGhhdCB2YXJpYWJsZS4gSXQgY2FuIGJlIGFueXRoaW5nIHRoYXQgbWlnaHQgYWZmZWN0IHRoZSByZXNwb25zZSB2YXJpYWJsZS4gTGV04oCZcyBzYXkgeW914oCZcmUgdHJ5aW5nIHRvIGZpZ3VyZSBvdXQgaWYgY2hlbW8gb3IgYW50aS1lc3Ryb2dlbiB0cmVhdG1lbnQgaXMgYmV0dGVyIHByb2NlZHVyZSBmb3IgYnJlYXN0IGNhbmNlciBwYXRpZW50cy4gVGhlIHF1ZXN0aW9uIGlzOiB3aGljaCBwcm9jZWR1cmUgcHJvbG9uZ3MgbGlmZSBtb3JlPyBBbmQgc28gc3Vydml2YWwgdGltZSBpcyB0aGUgcmVzcG9uc2UgdmFyaWFibGUuIFRoZSB0eXBlIG9mIHRoZXJhcHkgZ2l2ZW4gaXMgdGhlIGV4cGxhbmF0b3J5IHZhcmlhYmxlOyBpdCBtYXkgb3IgbWF5IG5vdCBhZmZlY3QgdGhlIHJlc3BvbnNlIHZhcmlhYmxlLiBJbiB0aGlzIGV4YW1wbGUsIHdlIGhhdmUgb25seSBvbmUgZXhwbGFuYXRvcnkgdmFyaWFibGU6IHR5cGUgb2YgdHJlYXRtZW50LiBJbiByZWFsIGxpZmUgeW91IHdvdWxkIGhhdmUgc2V2ZXJhbCBtb3JlIGV4cGxhbmF0b3J5IHZhcmlhYmxlcywgaW5jbHVkaW5nOiBhZ2UsIGhlYWx0aCwgd2VpZ2h0IGFuZCBvdGhlciBsaWZlc3R5bGUgZmFjdG9ycy4gIyMjIw0KDQohW10oaW1hZ2VzX2Zvcl9yL0Fubm90YXRpb24gMjAxOS0wOC0yMiAxNTAzNTUucG5nKQ0KDQohW10oaW1hZ2VzX2Zvcl9yL0Fubm90YXRpb24gMjAxOS0wOC0yMiAxNTA1MjYucG5nKQ0KDQohW10oaW1hZ2VzX2Zvcl9yL0Fubm90YXRpb24gMjAxOS0wOC0yMiAxNTE0NDMucG5nKQ0KDQoNCg0KIVtdKGltYWdlc19mb3Jfci9Bbm5vdGF0aW9uIDIwMTktMDgtMjIgMTUxNTQ1LnBuZykNCg0KIVtdKGltYWdlc19mb3Jfci9Bbm5vdGF0aW9uIDIwMTktMDgtMjIgMTUxOTUxLnBuZykNCg0KDQohW10oaW1hZ2VzX2Zvcl9yL0Fubm90YXRpb24gMjAxOS0wOC0yMiAxNTI3MTEucG5nKQ0KDQohW10oaW1hZ2VzX2Zvcl9yL0Fubm90YXRpb24gMjAxOS0wOC0yMiAxNTQ1NTEucG5nKQ0KDQoNCg0KDQo=